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Mapping to prosody: Not all parentheticals 
are alike 


Abstract: This study investigates the prosody of different types of sentence- 
medial parentheticals in Turkish. The results of a production experiment show 
that clausal parentheticals exhibit cues similar to intonation phrase-level cues 
such as pre-boundary lengthening of the final syllable, longer pauses, and 
higher final rise. Phrasal parentheticals, on the other hand, exhibit cues similar 
to phonological phrase-level cues on both edges. From these results, we con- 
clude that clausal parentheticals are prosodically isolated, supporting the theo- 
ries of syntax-prosody mapping, while XP parentheticals are prosodically inte- 
grated, partially supporting syntax-prosody mapping theories. The latter result 
supports theories that assume XP-to-phonological phrase matching, but not 
those that predict the prosodic isolation with all parentheticals. In this respect, 
Turkish marks constituent-to-constituent matching of syntax and prosody more 
faithfully than the mapping of syntactic isolation. Additionally, mapping of 
pragmatic information is highly favoured in Turkish. Specifically, pragmatically 
isolated parentheticals such as vocatives or interjections are prosodically isolat- 
ed, regardless of their syntactic make-up. We discuss the prosodic structure of 
Turkish parentheticals and propose a representation that favours the recursion 
of certain prosodic category types. 
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1 Introduction 


Prosodic constituency is affected by the input syntactic constituency (Selkirk 
1986, Nespor and Vogel 1986, Truckenbrodt 1995). How prosody mirrors syntax 
is the subject of theories of syntax-prosody mapping, such as the theory of Pro- 
sodic Hierarchy (Nespor and Vogel 1986, Selkirk 1986) and Match theory (Sel- 
kirk 2009, 2011). According to these theories, syntactic constituents match to 
corresponding prosodic category types and these types are hierarchically or- 
dered (1). Clauses map to Intonation phrases (ts), subclausal constituents (i.e. 
XPs) map to Phonological phrases (®s), and lexical words map to Prosodic 
words, (ws). 


(1) Hierarchy of prosodic category types (Selkirk 2009) 


Utterance (v) 


Intonational Phrase (1) 


Phonological Phrase (®) 


Prosodic Word (w) 


In the prosodic hierarchy; an ı is composed of one or more ®s, and a® is com- 
posed of at least one w. The category types are usually distinguished from one 
another by the difference in the strength of boundaries, the level of pitch reset 
across the adjacent prosodic constituents, and the directionality of the prosodic 
heads (e.g. left prominent categories vs. right prominent categories). Additional- 
ly, just like syntactic structures, prosodic structures may be recursive (cf. Ito 
and Mester 2009, 2012, Wagner 2005, Elfner 2012, Kawahara 2012). 

In the Autosegmental-Metrical Model of Intonational Phonology (AM) (cf. 
Pierrehumbert 1980, Ladd 1996), prosodic constituents are identified with refer- 
ence to certain segmental phenomena and metrical cues, such as the presence 
of boundary tones (e.g. T- for ©, T% for ı), the use of lexical/post-lexical pitch 
accents (T*), and/or the presence of down-step on the FO (!T). For example, in 
English, among other patterns, an ı may also be marked by ‘falling intonation’: 
a low-levelled flat FO on its right edge (i.e. t final boundary tone; L%). Such 
marking is usually an indication of the end of a discourse/turn. An t may be 
marked by ‘rising intonation’: a steep rise on its right edge (i.e. t final boundary 
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tone; H%). Coupled with a pause this is an indication of discourse continuation 
(cf. Nespor and Vogel 1986, Selkirk 2005). 

In this paper, we focus on the distinction between t and © in Turkish. Fol- 
lowing AM and Match theory, we investigate whether the prosody of parenthe- 
tical structures in Turkish is isomorphic to their syntax. Match theory predicts 
that parenthetical structures, such as the appositive en yakin arkadasim ‘my 
best friend’ in (2), are parsed as ts since they are isolated from the utterance to 
which they interpolate (hereafter the host). 


(2) Aynur, enyakın arkadas-ım, ben-iparti-ye davetet-me-di. 
Aynur most close friend-1Pposs I-Accparty-DAT invite do-NEG-PST 
‘Aynur, my best friend, did not invite me to the party.’ 


However, some studies, such as Wichmann (2001), Peters (2006), Deh& (2007), 
Kaltenböck (2009), Dehé (2009), and Dehé and Wichmann (2010), show that not 
all cases of parenthetical insertions lead to prosodic isolation from the ı of their 
host. Depending on various factors, such as the length of the inserted item 
(Dehé 2007, 2009; Dehé & Kavalova 2007, and Kaltenböck 2009) or the function 
of the parenthetical (Peters 2006), some parentheticals do not exhibit prosodic 
isolation cues such as language specific ı-formation cues (e.g. t-level boundary 
tone insertion, longer pauses, longer final syllables). The prosodic integration of 
parentheticals poses problems for the theories of strict syntax-prosody map- 
ping, and demonstrates that other properties of language may override the re- 
quirement of mapping syntax into prosody. 

By prosodic isolation, we mean the parsing of a prosodic unit as a category 
above ® (in this paper it is 1). By prosodic integration, we mean the parsing of a 
prosodic unit as a category below ı (in this paper it is ® or potentially w) (Selkirk 
1986, Bolinger 1989, Gussenhoven 2004, Dehé 2007, and Kawahara 2012). 

Focusing on the case of Turkish, we address two related issues. First, we 
examine to what extent the prosodic structure of Turkish validates the predic- 
tions of universal syntax-prosody mapping theories, with respect to t-formation. 
Second, we aim for a better understanding of the cross-linguistic factors that 
lead to prosodic isolation/integration of parentheticals. 


2 Fora syntactic account of such parentheticals in Turkish see Griffiths and Güneş (2014). 

3 For syntactic isolation accounts see e.g. Haegeman (2009), Peterson (1999), Burton-Roberts 
(2006), for semantic isolation accounts see Potts (2005), De Vries (2007), Arnold (2007), and see 
Selkirk (2005) for a Match Theory based account that predicts the prosodic isolation of paren- 
theticals. 
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These issues have previously been addressed in Günes (2014), which dis- 
cusses the results of a production experiment on two parentheticals of Turkish 
(yanılmıyorsam ‘if I am not mistaken’, and bence ‘for me’ or ‘I think’). The re- 
sults described therein demonstrate that (i) no significant difference pertains 
between the final rise of pre-nuclear parentheticals and pre-nuclear arguments, 
(ii) both post-nuclear arguments and post-nuclear parentheticals exhibit low 
and flat FO, (iii) parentheticals are pronounced at similar pitch intervals to ar- 
guments in corresponding positions, and (iv) boundary tones on the edges of 
parentheticals and preceding host constituents are ®-level tones. Based on 
these results, Güneş concludes that yanılmıyorsam and bence are parsed as Ms 
and not ts, and thus display prosodic integration. From this investigation one 
can conclude that ı-formation in Turkish is not dependent solely upon syntactic 
input. 

This paper investigates whether Giines’ conclusions are supported by re- 
sults from an expanded data set. To achieve this, we conducted a production 
experiment to examine the prosodic behaviour of a number of sentence-medial, 
pre-nuclear Turkish parentheticals of different lengths, syntactic forms, and 
pragmatic functions. 

The paper is organized as follows: Section 2 outlines the prosodic constitu- 
ency of Turkish. Section 3 introduces the methodology of the experiment. Sec- 
tion 4 reports the results. Section 5 discusses the findings and section 6 conclu- 
des. 


2 Background on the prosody of Turkish 
Declaratives: ® and ı 


©® and ı are domains above the w that are envisaged for Turkish (Kabak and 
Vogel 2001, Kabak and Revithiodou 2009, Kan 2009, Kamali 2011). Kabak and 
Vogel (2001), the first study on Turkish that depicts a level higher than w in 
terms of Prosodic Structure Theory (cf. Selkirk 1986 et seq., Nespor and Vogel 
1986), portray the ®-level as embodying phrase-level stress (i.e. head promi- 
nence). They observe that head prominence falls on the leftmost w of an ®.* 


4 Note that they associate the head only with the stress-bearing syllable of the leftmost w, and 
not with the entire leftmost w. By instrumental analysis, Kamali (2011) and Giines (2013) ob- 
serve that the overall pitch level of head w (including the unstressed syllables of that w) is 
relatively higher than the overall pitch level of the non-head w (this is called relative levelling of 
pitch register). Therefore, in our view, the head of a ® is not only the syllable but the entire w. 
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Directionality of head prominence, being specific to ®-level constituent for- 
mation, is considered as a diagnostic for the ® in Turkish. In the consequent 
studies (Özge and Bozsahin 2010, Kamali 2011), Ms are also observed to bear 
phrase accents/edge tones that delimit one ® from another (e.g. H- for the right 
edge of Ms). Kan (2009) lists four cues that identify the difference between ts 
and Ọs in Turkish: (i) boundary tones (H- for ® level and H% for level), (ii) 
pauses (if there are any, then shorter across Ms and longer across ts), (iii) head 
prominence (left prominent ®s and right prominent ts), and (iv) final lengthen- 
ing (shorter before ® boundaries, and longer before t boundaries). We employ 
Kan’s diagnostics for the identification of prosodic category types of different 
levels in the prosodic hierarchy here. In addition to these diagnostics, we in- 
cluded the degree of initial lowering (the level of L and L%), which tests for 
pitch reset on the left edges of parentheticals. This diagnostic is added because 
a recent study on Japanese parentheticals (Kawahara 2012) observes that a 
change in the degree of initial tone marks a variation in the organization of 
prosodic structure. 

The presence of accent is lexically contrastive in Turkish (there is only one 
accent and it is a binary one (Kamali 2011: H*L). Words can be accented and 
accentless in Turkish.”° We utilize only accentless ws in our experiment. By 
doing this, we aim to eliminate acoustic variation that arises due to the exist- 
ence of an accent. The prosodic constituency structure of Turkish declarative 
root clauses with accentless ws is illustrated below.’ 


The high-levelled pitch register aligned with the heads of ®s is observed in the data reported 
here as well. 

5 In Turkish most words bear stress on their final syllable (Sezer 1981, Inkelas and Orgun 1998, 
Kabak and Vogel 2001). The words that bear non-final stress are the words that bear accent, 
where the pitch accent falls on the stressed syllable. In the case of finally stressed words, the 
final syllable has stress accent but not pitch accent (Konrot 1981). 

6 That there are accented and accentless words in Turkish has been proposed recently by 
Kamali (2011), who also suggests that, in this regard, Turkish should be considered as a “pitch 
accent language”. The classification of Turkish in terms of its word melody is beyond the scope 
of the current study. 

7 See Kamali (2011) for a similar representation. 
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(3) The t of Turkish declarative root clauses with accentless ws: 


L H- L H- L H ILI L% 

( nucleus )ı l 
(___non-final Jo ( non-final jo ( final Jo ® 
( Jo ( Jo (head Jw ( Jw w 


The tonal variation of constituents is highly dependent on their position. The 
nucleus (the perceptually prominent part of a root clause) is the defining point 
of the intonational template in declaratives. Any constituent that occurs to the 
left of the nucleus (pre-nucleus) bears a rising terminal (H-) with an optional 
lexical accent, and any constituent that occurs to the right of the nucleus (post- 
nucleus) shows a levelled pitch contour. We refer to the ® that contains the 
nucleus as the final-® and all the ®s that linearly precede the nucleus as non- 
final-®s. The only difference between non-final-®s and final-®s is that final-Ds 
do not bear H- on their right edge. The high levelling of FO marks the initial w of 
the final-® as the head that bears a H on its right edge. The second w of the 
final-® begins with a low left edge (L), the level of which is scaled lower relative 
to the L of the first w in the final-®. 

Notice that the L of the second w in the final-® is lower than the L on the 
left edge of the first w of the final-®. This, we believe, is due to the difference in 
the prosodic category that each L belongs to. What we assume is that L on the 
left edges of all ®-initial ws bear a high levelled low left edge tone (L) marking 
the left edges of Ms. On the other hand, any non-initial w in a ® bears a low 
levelled low left edge tone (again annotated with a L) that marks the left edges 
of ws. In this respect, the phonetic difference between the ®-level L and the w- 
level L is that the former bears relatively higher pitch than the latter. In the pre- 
sent paper, only the ®-level L is analysed. 

The resulting high plateau in the final-® is the head of the final-® and also 
the head of the t: it is the nucleus. In this study nucleus is taken as a prosodic 
unit. 

Note that (3) bears a L% right edge t-level boundary. The leftmost t in (4) il- 
lustrates an utterance with comma intonation (a right-edge rise that is higher 
than the rise of the non-final Os): H%. 
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(4) Two consequent ts with comma intonation (H%) with accentless ws 


L H- L H È H% ï H- L H-E L% 
( nucleus )ı ( nucleus J 
( non-final )o ( final Jo (_non-final )o ( final Jo ® 
( Jo (__head  )u ( Jw ( Jo (__head__)u ( Jw w 


In a declarative utterance with comma intonation, the right edge (final syllable) 
of the t is marked with a steep rise. Note that the steepness of the rise in (4) is 
also due to post-nuclear levelling. Specifically, since the final-® is composed of 
more than one w, there is post-nuclear lowering of the pitch register in the area 
after the head of the final-®. The post-nuclear area bears the lowest FO register 
in Turkish declarative utterances (Ozge and Bozsahin 2010). Therefore, any H 
tone that follows this area is expected to be steeper than the rest of the post- 
nuclear area, and the rest of that t. If there is no item following the nucleus, the 
rise is expected to be less abrupt. (5) illustrates a case where the ı ends with the 
nucleus (i.e. where the final-® bears only one w). 


(5) Two consequent ts with comma intonation (H%) with accentless ws and 
single-w final-® 


_ ren m „_— Be a 

L H- L H% Ñ H- L L% 

( nucleus _), ( nucleus ) t 

(_non-final Jo (final )o (__non-final _)e (__ final )o ® 
( 


(Cs (head _)w jo ( head _)o w 


The right edge boundary tone of the first utterance does not bear a steep rise. 
However, the boundary is still interpreted as stronger than the ®-level and is 
marked with an t-level boundary tone (H%). This is due to the existence of other 
t-level edge properties; i.e. lengthening of the final syllable and a following 
pause. In the experiment reported in this paper, we considered this variation 
and controlled the data accordingly. 
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3 Method 


In this experiment, the prosodic properties of parentheticals and the surround- 
ing host (non-)constituents were compared to corresponding ® and ı bounda- 
ries within and across root clauses that do not contain parentheticals. Pre- 
boundary lengthening of syllables, FO values on the constituent edges (bounda- 
ry tones), and the distribution of the pauses were investigated. 


3.1 Stimuli 


Stimuli were drawn from two sets: the control and test utterances. The control 
contained two sub-sets: one for ® boundaries and the other for ı boundaries. All 
target utterances exclusively contained accentless words. To the highest possi- 
ble degree, these words were also devoid of obstruents, especially in the areas 
targeted in our analysis (i.e. the edges of the analysed items). The total number 
of sentences processed was 704 (176 sentences x 4 speakers). 


3.1.1 Control set 


There were two groups in the control set: the ®-boundary group and the ı- 
boundary group. For the ® boundary, there were 48 target utterances in total 
(VP-adverbial case: 14, argument case: 34). For the t boundary, there were 45 
cases in total (single-w final-®: 21, multi-w final-®: 24). In total, there were 372 
boundaries present in the control set (93 boundaries x 4 speakers). 


3.1.1.1 ® boundaries in the control 

Only those elements that were parsed as non-final-Ps were included for ®- 
boundaries in the control. Because the non-final-® that immediately precedes 
the nucleus bears a higher H- then the preceding non-final-Os (Günes 2014; 
İpek and Jun 2013), we included in the control only those non-final-®s that are 
non-adjacent to the final-® in our analysis. We did this in order to eliminate 
unnecessary variation for the statistical analysis. 

For the FO analysis of the pre-nuclear final rise, we employed two kinds of 
non-final-®s: (i) ®s with single word, and (ii) Ds with two or more words. Un- 
like the single-w ®s (6a), multi-w ®s display a ‘head vs. non-head’ distinction 
(6b). Heads of Bs in Turkish are marked by a higher pitch register, and a H tone 
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on the right edge, while the non-head bears FO lowering (cf. Kamali 2011, Giines 
2013). 


(6) a. Control set for © boundaries (single-w argument) (®-ar) ® 


L H- L H- L H 18 L% 
(Dallama-lar)non-to (yalan-lar-1)non-t (ilgili-ler-e duyur-uyor):o 
jerk-PL lie-PL-ACC associate-PL-DAT give.out-PROG 


‘The jerks give out some lies to the officers.’ 


b. Control set for ® boundaries (multi-w arguments) (®-ar) 


(Genelde)nonto (yeni görevli-ler)non-o (yarım marul-lar-ı)non-o (yayla-lar-a)non-o 
Usually new staff-PL half lettuce-PL-Acc field-PL-DAT 

L H L L% 

(iade ed-iyor):o 


return make-PROG 


‘Usually, the new staff sends the half lettuces back to the fields.’ 


Parentheticals are frequently analysed as syntactically adjoined to their host 
(Corver & Thiersch 2001, Potts 2005, De Vries 2006, Van Maastricht 2011, and 
Griffiths this volume). Bearing this in mind, VP-adverbs such as in (7) were also 
included in the control, as instances of regular adjunction. 


8 Rounded rectangles mark the analysed constituents. 
9 To ensure semantic isolation pertains between the adjoined parenthetical and its host, a sui 
generis adjunction mechanism must typically be posited (cf. De Vries 2008, 2012). 
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(7) a. Control set for © boundaries (single-w VP-adverb) (B-app) 


1% 
(Amir-ler)non-to (ileri-ler-de)nonto (ağa-lar-1)non--o (ön-e al-ıyor)eo 
chief-PL ahead-PL-LOC landlord-PL-Acc front-DAT take-PROG 
‘Ahead of us, the chiefs let the landlords join the cue from the front.’ 
b. Control set for ® boundaries (multi-w VP-adverb) (®-app) 
L H- L HL L% 


(Yuva-lar)nono (öğle-ye doSru)non-o (yavru-lar-1)nont0 (uyku-ya yatır-ıyor)eo 
nursery-PL noon-DAT towards baby-PL-ACC sleep-DAT lie-PROG 
‘Around noon, the nurseries put the babies to sleep.’ 


3.1.1.2 t boundaries in the control 

We utilized strings of consecutive clauses for t boundaries in the control. Only 
clauses that bear continuation intonation (comma rise; H%) were included. To 
analyse the FO values of the right edge boundaries and control the amount of 
the final rise, a setting similar to that in (6) and (7) was employed. In this case, 
the final-® of each non-final clause was either single-w (i.e. the nucleus alone) 
or multi-w (i.e. the nucleus and a following low-levelled FO). Single-w and mul- 
ti-w final-®s enjoyed equal representation in the control. The two conditions are 
illustrated in (8). 


(8) a. Control set for t (t-n) boundaries (single-w final-®) 


L H- L H L L% 
[(Münire)non-to (menemen-e);o): [(Neriman)non-to (yaglama-ya dadan-1yor):o): 
Miinire omelet-DAT, Neriman yaglama-DAT pick.at-PROG 


‘Münire has been picking at the omelet; Neriman the pancake.’ 


Bereitgestellt von | De Gruyter / TCS 
Angemeldet 
Heruntergeladen am | 16.10.19 15:50 


Mapping to prosody — 297 


b. Control set for ı (ı-pn) boundaries (multi-w final-®) 


[(Emine)non-to (yavru-yu göm-dü)eoll  [(Miraye)nonto (yer-ler-I ovala-dı)eol. 


Emine puppy-Acc bury-PST Miraye floor-PL-ACC scrub-PST 
_ TARS -\ 
L H- È H L L% 
[(Neriman)non-£- (helva-yı —- yoSur-du)ro): 
Neriman halvah-acc knead-PST 


‘Emine buried the puppy. Miraye scrubbed the floor. Neriman kneaded the halvah.’ 


3.1.2 Test set 


The test set was split into verbal and XP parentheticals.'° The test set was com- 
posed solely of sentence-medial pre-nuclear parentheticals that were not in the 
immediately pre-nuclear position. Single-w parentheticals and multi-w paren- 
theticals were equally represented. 


3.1.2.1 Verbal parentheticals 

Verbal parentheticals are clausal parentheticals that contain a finite verb. This 
test set comprised 31 parentheticals. It contains four subsets, each containing at 
least 5 utterances. These subsets are: 


(9) a. Comment clauses (com) (8 utterances) 
Biinyamin, biiyiik oranla dogrudur, yegenini Meray’a ayarliyormus. 
Biinyamin, it is probably true, matches his cousin up with Meray. 


b. Finite non-restrictive relative clauses (finnon) (5 utterances) 
Maymunlar, ki yabanidirler, liderlerine boyun egerler. 
Monkeys, and they are wild, obey their leaders. 


10 Originally, the test set contained a third group of parentheticals: ‘in-between’ parentheti- 
cals. This group consisted of amalgamations, i.e. those constructions like John is going to I think 
it is Chicago on Sunday (Kluck 2011). Since it is unclear where the syntactic and prosodic 
boundaries lie in such structures, they were not included in the analysis. They were instead 
used as fillers. Prosodic phrasing of these forms requires future inquiry. 
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c. Finite adverb-like parentheticals (adfin) (13 utterances) 
Maymunlar, yıllar ilerler, yavruları yuvalara gömerler. 
Monkeys, (and) the years pass by, bury the infants in their dens. 


d. Interruptions (inter) (5 utterances) 
Mamayı, yana doğru eğilin, bebeğe biberonla veriyorlar. 
They give the food, lay down on your side, to the baby in a nursing bottle. 


In (9), verbal parentheticals are divided according to their surface structure. 
Verbal parentheticals can also be split across another dimension, viz. their 
pragmatic function. This split is binary — either verbal parentheticals are prag- 
matically integrated into their host (9a-c), or they are pragmatically isolated 
from their host (9d). Unlike pragmatically integrated parentheticals, which 
contribute to the discourse structure containing their host, pragmatically isolat- 
ed parentheticals contribute only to the situational context in which the host is 
uttered. To illustrate, consider (9d), where the speaker is a doctor, the addressee 
is a patient in the setting of a hospital. Here, the interruption is relevant to the 
situation (i.e. the doctor’s examination of the patient) but not to the topic of 
conversation (i.e. how to feed a baby). Among pragmatically integrated paren- 
theticals, comment clauses, e.g. (9a), exhibit a different semantic-pragmatic 
relation when compared to other pragmatically integrated clausal parentheti- 
cals, e.g. (9b) and (9c). They are even more integrated than the other parenthe- 
ticals (Reis 2000, Asher 2000, Dehé and Wichmann 2010 and Dehé to 2014). This 
difference is due to the fact that comment clauses present the speaker’s mental 
disposition about the truth validity of the host proposition. In this sense, their 
“communicative value is roughly equivalent to an adverbial...” (Reis 2000:9). 
“A true integrated reading (= Reinhart’s ‘speaker-oriented’ reading)” (ibid.: 12) 
of comment clauses results in prosodic integration across languages (Reis 2000 
for German, Reinhart 1983, Dehé and Wichmann 2010, and Dehé 2014 for Eng- 
lish). 


3.1.2.2 XP parentheticals 

Different types of subclausal constituents comprised the test set of XP parenthe- 
ticals (or phrasal parentheticals). 26 XP parentheticals were used for this test set. 
This set contains four subsets, with each subset containing at least 5 utterances. 
These subsets are: 


Bereitgestellt von | De Gruyter / TCS 
Angemeldet 
Heruntergeladen am | 16.10.19 15:50 


Mapping to prosody — 299 


(10) a. Mitigative adverbials (admit) (5 utterances) 
Memurlar, anladığım kadarıyla, alanlarda bela arıyor. 
‘The officers, to my understanding, look for a trouble in the arenas.’ 


b. Nominal appositives (appo) (6 utterances) 
Emir’i, yeğenimi, arabayla oyuna götürüyorlar. 
‘They always take Emir, my cousin, to the play by car.’ 


c. Post-positional (peripheral) adverbials (adper) (10 utterances) 
Alevler, alınan önlemlere ragmen, yalılıları dumana bogdu. 
‘The flames, despite the precautions taken, have smothered the residents 
in fog.’ 


d. Vocatives (voca) (5 utterances) 
Koyunları, degerli yöremizin yerlileri, agillarina yenilerle yollarız. 
‘We send the cows, (you) dear locals of our region, to the barns with the 
new ones.’ 


Similarly to (9), XP parentheticals can split according to their level of pragmatic 
integration. (10a-c) are pragmatically integrated, while the vocatives such as the 
one in (10d) are isolated. 


3.2 Data elicitation 


Target utterances were presented to the participants as MS PowerPoint slides on 
a screen. Each slide contained three sections. First was the context section, 
where an imaginary context was described so that the participant could draw on 
her knowledge of information structure to produce the target utterance in a 
more natural manner. Second was the elicitation question, which fixed the posi- 
tion of the nucleus in the target utterance. Third was the target utterance. All 
the standard orthographic conventions of the Turkish language were employed 
in the context section, the eliciting question, and the target utterance. The ut- 
terances were elicited by means of a role-playing game. The subjects read the 
context, the question and the target answer (not out loud). Then, the experi- 
menter (the first author) read aloud the eliciting question. The subjects then 
employed the target sentence as an answer to the experimenter’s question. This 
procedure was repeated throughout the experiment to elicit the same target 
whenever necessary (in cases of extraneous interruption, etc.). 
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3.3 Speakers and the recording procedure 


Four female speakers of standard Turkish participated in the experiment. All 
speakers were university graduates. At the time of the recording, the speakers 
were aged 32, 32, 37, and 55 respectively from speaker one to four. All speakers 
except the first were monolingual speakers of Turkish. The first speaker was 
recorded in a sound-proof audio studio with an external microphone using the 
software program Adobe Audition 3.01, at the University of Groningen. The 
other three speakers were recorded with an Olympus digital voice recorder (WS- 
812). The recordings of the last three speakers took place in rooms with minimal 
background noise. At the time of recording, only the experimenter and the par- 
ticipant occupied the room. Before the experiment began, four sets of training 
slides were presented to ensure that the participants were familiar with the 
experimental procedure. The training material was repeated multiple times if 
necessary. The experiment lasted approximately 45 minutes. 


3.4 Data processing and measurement 


The sound files of each speaker were processed using Adobe Audition 3.01. The 
amplitude values of the sound were normalized and the background noise was 
eliminated (via ‘noise reduction’) to filter out any potential non-speech sound 
interference. After the noise reduction, each target utterance was extracted and 
transferred to PRAAT 5.3.02 (Boersma & Weenink 2011). Using PRAAT, all octave 
jumps were eliminated. For all speakers, the pitch interval was kept constant 
(ceiling 400Hz, floor 75Hz). In the statistical analysis of the FO, only the semi- 
tone values were used (semitones re 100Hz). All utterances were parsed to their 
syllable, word and sentence boundaries. The parsing procedure was carried out 
in three steps: first all syllable, word, and utterance boundaries were manually 
parsed in PRAAT by a non-Turkish speaker, and then verified by a native speak- 
er of Turkish (the first author). The detailed control of the syllable boundaries 
was undertaken with close reference to the spectrograms, formant values, and 
waveforms. All word and sentence boundaries were then manually re-aligned 
with the corresponding syllable boundaries, two weeks after the second step. 
Pauses (if any) were also parsed. 
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3.5 Statistical analyses 


Duration and FO values of all utterances were analysed in R (R Development 
Core Team 1993-2011). We used linear mixed-effect models for analysing the 
data. All analyses were performed using Ime4 package (Bates et al. 2013). In the 
presentation of the analyses that follow we present fixed effect parameters of 
each model with associated standard error (SE) and t-statistic. All models we 
report are zero-intercept models. As a result the estimates of the model parame- 
ters correspond to the expected value of the response variable for the respective 
group of the explanatory variable. Depending on the model, we also include one 
or more random effects to account for the variation due to factors such as 
speaker, relevant syllable type and the length of the item. 

Speaker variation is a commonly observed source of variation in the results. 
Therefore, speaker is included as a random effect in all the reported models. 
Another potential source of variation is the length of the items tested (the num- 
ber of syllables per item). Length is also included as a random effect in all mod- 
els reported here to avoid the length-related variation in the results. In all re- 
sults reported below, we use a five-level categorical variable representing the 
length of the phrase where items with one, two, and three syllables form the 
first three category, and items with four to six (inclusive) syllables and items 
with seven or more syllables form the last two categories. The category decision 
was based on inspection of effect of the length on syllable durations in the en- 
tire data set. 

Finally, dissimilarity in duration persists between open and closed sylla- 
bles. To avoid any effect of the syllable type, we include syllable type as a ran- 
dom effect in all models whereby the response is the syllable duration (i.e. the 
cases of final lengthening). 

For all cues, we only report results from ‘intercept-only’ random effect.'' We 
report standard errors and t-scores for the fixed effects, and the estimated 
standard deviation of the random effects. We also present point estimates of cue 
values (e.g. final syllable duration) and plus and minus one standard error in- 
terval around the estimate for each category graphically. 


11 In our experiments inclusion of random slopes neither improved the model fit nor affected 
the parameter estimates substantially. Hence, we present the intercept-only random effects for 
the sake of simplicity and consistency. 
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4 Results 


4.1 Final Lengthening 


Segmental makeup of syllables may have an effect on the results when we gen- 
erate the estimates of final syllable durations. Particularly, open (in our data 
.CV) and closed (in our data .CVC) syllable values may be consistently different 
across different prosodic category types. The results show that this is indeed the 
case when we compare the mean values of the open and closed final syllables of 
ts and Ọs in the control. Accordingly, the mean duration of t-final open syllables 
is 181ms, while the mean duration of ®-final open syllables is 123ms. Similarly, 
the mean duration of t-final closed syllables is found to be 242ms, while the 
mean duration of ®-final closed syllables is 206ms. As a result, although there 
is a consistent difference between the duration of open and closed syllables 
(where the former is shorter than the latter), the categorical variation persists: 
i.e. t-final syllables are longer than ®-final syllables, regardless of the syllable 
type. Particularly, open t-final syllables are shorter than open ®-final syllables 
and closed t-final syllables are shorter than closed ®-final syllables. When de- 
signing the stimuli, the segmental properties and the syllable type distribution 
of the final syllables are not controlled, and in the analysis, syllable type is 
included as a random factor. 


4.1.1 Parenthetical vs. ® and ı 


Final syllable duration of the parentheticals is compared to the final syllable 
duration of the ® and ı in the control. Figure 1 presents the final syllable dura- 
tion averages for all types. The solid horizontal line denotes the mean word- 
final syllable duration for all words, both in control and target sentences. The 
dashed horizontal line represents the overall mean of all syllable durations. 
Shading indicates the type of the phrase: white is t, darker grey is ®, darkest 
grey is verbal parenthetical, and light grey is XP parenthetical. 
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Fig. 1: Final syllable duration of the parentheticals, ® and t boundaries 


As Figure 1 shows, the averages of the final syllable duration for some parenthe- 
ticals are closer to t, the final syllable duration of which is higher than average 
word-final syllable duration. While ®-final syllable duration values are around 
average, D-app (low adverbs) exhibits markedly shorter final syllables.'? A simi- 
lar variation is observed in parenthetical type admit (mitigative adverbials). 
Figure 1 also indicates a difference between verbal and XP parentheticals (the 
former having longer final syllables). Among the verbal parenthetical group, 
inter (interjections) bears substantially longer final syllables; more than the t- 
final syllable duration. Among the XP parenthetical group, voca (vocatives) 


12 The results of other tests, which are reported in the current paper, indicate that low adverbs 
in the ®-control group are different from the arguments of the same group. We observed that, 
in all cues, the D-app cases show weaker boundaries than the ®-ar cases. That Turkish prosod- 
ic structure marks low adverbs differently from the arguments is a very interesting observation. 
Yet, a more elaborate investigation that specifically focuses on this distinction is essential for 
comprehensive conclusions. 
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exhibits the longest final syllables, with an equal (if not higher) t-final syllable 
duration.” 

Figure 2 and Table 1 present the parameter estimates of a linear mixed-effect 
model fit with random intercepts for length (as the total number of syllables of 
each chunk), speaker and syllable type, whereby the predictor is the general 
phrase types ‘parenthetical’, ‘intonational phrase’ and ‘phonological phrase’. 


L — = 
& ----@--- 
Par.(verb) —— 


a ee 


Fig. 2: Estimates of the final syllable durations (in seconds) of ı, ®, verbal parenthetical and XP 
parenthetical 


13 As reported here and in the following sub-sections, the groups voca and inter behave differ- 
ently in comparison to their group-mates. Particularly, the box-plots for all cues report that 
voca is substantially different from the other XP parentheticals and inter is substantially differ- 
ent from the verbal parentheticals. This variation could be a source of misleading results for 
linear mixed-effect models, as all estimates are calculated relatively. For this reason, the to- 
kens of voca and inter were not included in any of the linear mixed-effect models. 

For all cues, the cases of voca and inter are only discussed based on the average values report- 
ed in box-plots. 
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Fixed effects 


Estimate SE tvalue 
l 0.216 0.034 6.281 
(0) 0.164 0.034 4.776 
Par.(verbal) 0.184 0.035 5:331 
Par.(XP) 0.183 0.035 5.290 


Random effects s.d. 


Length Intercept 0.0024 
Speaker Intercept 0.0082 
Syl. Type Intercept 0.0480 


Table 1: Estimates of the final syllable durations (in seconds) of t, ®, verbal and XP 
parenthetical 


Intonational phrases show the longest final syllables. Phonological phrases 
exhibit the shortest final syllables, and two groups of parentheticals are in- 
between, being slightly closer to ® than t. There does not seem to be a differ- 
ence in the final syllable length values of the verbal and XP parentheticals — 
both exhibit ®-like durations. 

Figure 3 and Table 2 present a model fit to the same data with a more de- 
tailed grouping of the parentheticals and phonological phrases, which reveals 
some differences between the verbal and XP parentheticals. 


L 

-ar 
-app 
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appo 
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f T | 
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Fig. 3: Estimates of the final syllable durations with all parenthetical type 
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Estimate SE t value 
Fixed effects 
l 0.218 0.035 6.216 
®-ar 0.168 0.035 4.801 
®-app 0.147 0.035 4.171 
Finnon 0.204 0.037 5.542 
Adfin 0.190 0.036 5.348 
Com 0.159 0.036 4.408 
Adper 0.180 0.036 5.060 
Appo 0.191 0.036 5.283 
Admit 0.179 0.037 4.866 
Random effects s.d. 
Length Intercept 0.0046 


Speaker Intercept 0.0082 
Syl. Type Intercept 0.0487 


Table 2: Estimates of the final syllable durations with all parenthetical types 


With respect to the control, a large difference pertains between t-final and ®- 
final syllable durations. t-final syllables are long and ®-final syllables are short. 
The final syllables of some parentheticals are dissimilar in duration to both t- 
final and ®-final syllables: they are neither long nor short (finnon and adfin 
among the verbal parentheticals, and appo among the XP parentheticals). 
Among the XP parentheticals, the final syllable durations of adper and admit is 
very close to the ® category, as well as com, which is in the group of verbal 
parentheticals. Nevertheless, the final syllable of verbal parentheticals endures 
for slightly longer than its XP parenthetical counterpart. Thus, the ordering of 
the tested items in terms of their final syllable duration is as follows: 


(11) t> verbal parenthetical ~ XP parenthetical > © 


4.1.2 Pre-parenthetical host boundary vs. ® and ı 


To see if the part of the host that linearly precedes the parenthetical insertion is 
isolated or not, the final syllable duration of the pre-parenthetical host item is 
compared to the final syllable duration of the ® and t in the control data. Figure 
4 compares pre-parenthetical host-final syllable duration classified for each 
parenthetical type with the last syllables of ® and ı. 
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Fig. 4: Final syllable duration of ® and ı boundaries and the pre-parenthetical host-clause part 
for all parenthetical types 


The pre-parenthetical final syllable duration is as long as or longer than ı 
boundaries in the case of verbal parentheticals. The host part that precedes XP 
parentheticals is shorter than the overall duration of verbal parenthetical cases 
and closer to ®-final syllables. As before, speaker and syllable type and the 
length of the analysed item cause systematic variation in the results presented 
in Figure 4. Therefore, we create a model where length, speaker and syllable 
type are included as random effects. Figure 5 and Table 3 present the results. 
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Fig. 5: Estimates of the final syllable durations of t, ®, and pre-parenthetical host syllable 
durations before verbal parenthetical and XP parenthetical 


Estimate SE tvalue 

Fixed effects 

l 0.216 0.033 6.548 

® 0.164 0.033 5.010 

Par.(verbal) 0.199 0.033 6.018 

Par.(XP) 0.188 0.033 5.664 
Random effects s.d. 

Length Intercept 0.0000 


Speaker Intercept 0.0059 
Syl. Type Intercept 0.0461 


Table 3: Estimates of the final syllable durations of ı, ®, and pre-parenthetical host syllable 
durations before verbal parenthetical and XP parenthetical 


Figure 5 and Table 3 provide results closer to those observed with final syllable 
duration. The duration of pre-parenthetical syllables lies between the duration 
of the final syllables of ® and ı. Again, a difference in duration between verbal 
and XP parentheticals is observed - this time in a more pronounced way. Pre- 
verbal parenthetical host-final syllable duration is closer to t-final syllable dura- 
tion, while pre-XP parenthetical host-final syllable duration is closer to the ®- 


final syllable duration. 


The parameter estimates of the detailed model including all the sub-groups 


of the control and the test are presented in Figure 6 and Table 4. 
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Fig. 6: Estimates of the final syllable durations of ı, ® and pre-parenthetical host syllable 
durations for detailed parenthetical types 


Fixed effects 
l 
Q-ar 
®-app 
finnon 
adfin 
com 
Adper 
Appo 
Admit 
Random effects s.d. 
Length Intercept 
Speaker 


Syl. type 


Intercept 
Intercept 


Estimate 


0.216 
0.168 
0.149 
0.226 
0.209 
0.160 
0.174 
0.223 
0.169 


0.0004 
0.0060 
0.0489 


SE 


0.035 
0.035 
0.035 
0.036 
0.035 
0.036 
0.035 
0.036 
0.036 


tvalue 


6.180 
4.831 
4.231 
6.253 
5.939 
4.494 
4.904 
6.191 

4.655 


Table 4: Estimates of the final syllable durations of t, ® and pre-parenthetical host syllable 
durations for detailed parenthetical types 


The results of the detailed model in Figure 6 and Table 4 support the earlier 
model’s conclusions with some exceptions. Although pre-verbal parenthetical 
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syllable duration is longer than pre-XP parenthetical duration, and closer to ı- 
final syllable duration, the host part that immediately precedes com has the 
shortest difference, being closer to XP parenthetical and ®-final syllable dura- 
tion. A similar behaviour is observed for the host part that precedes XP- 
parenthetical appo, which is, this time, closer to t-final syllable duration. 

In sum, on average, the final syllable duration values before parentheticals 
are in between t-final and ®-final syllable durations. The ordering of the tested 
items in terms of their final syllable duration is shown below: 


(12) ı = pre-verbal-par. host > pre-XP-par. host > ® 


4.2 Pauses 


Number and duration of the pauses before and after the parentheticals are com- 
pared to the number and duration of the pauses before and after Ms and ts in the 
control. Figure 4 presents a logarithm of the duration of the pauses before (left) 
and after (right) the indicated phrase types. The graph only presents the dura- 
tions where a pause occurred. The rate of pauses after indicated types is ana- 
lysed and presented separately. 
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Fig. 7: Duration (in log scale) ofthe pauses before (left) and after (right) each type 
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The first impression we get from Figure 7 is that pauses occur both before and 
after all phrases of interest. For all parentheticals, the general tendency is that 
the pauses before the tested items are shorter than the pauses that come after. 
The pauses that occur before and after the interjections are the longest in dura- 
tion. The pauses that come after the appositives are the longest among XP par- 
entheticals. Pauses surrounding ts are considerably longer than the pauses that 
surround © types. 

Pause duration results are in line with the final syllable duration values; ı > 
parenthetical > ®. Parenthetical types show a large internal variation. Within 
parentheticals there is a verbal parenthetical > XP parenthetical ordering, espe- 
cially in the cases of pauses that follow the parentheticals. Excluding the set of 
inter, the pauses that come before the parentheticals seem to be closer in dura- 
tion to the pauses that come before ® types. 

We fit two more models as before, this time predicting the pauses on the 
sides from the phrase types. We include speaker and the item length as the ran- 
dom variables. The model in Figure 8 and Table 5 estimates the pause durations 
in the occurrences before the four main groups. 


Par.(verb) 1 -- 
Par.(XP) = > 


0.00 0.05 0.10 0.15 


Fig. 8: Estimates of the duration of the pauses that occur before ı, ®, verbal and 
XP parenthetical 


Bereitgestellt von | De Gruyter / TCS 
Angemeldet 
Heruntergeladen am | 16.10.19 15:50 


312 — Güliz Güneş and Çağrı Cöltekin 


Estimate SE t value 

Fixed effects 

l 0.139 0.011 12.583 

0) 0.013 0.009 1.423 

Par.(verbal) 0.038 0.011 3.399 

Par.(XP) 0.020 0.012 1.695 
Random effects s.d. 

Length Intercept 0.0051 


Speaker Intercept 0.0089 


Table 5: Estimates of the duration of the pauses that occur before ı, ®, verbal and XP parenthe- 
tical 


The model fit on the pauses that are observed before the phrases show that 
pauses that follow t are remarkably longer than the others. Pre-t pause duration 
is followed by pre-verbal parenthetical pause duration, which is followed by the 
duration of pauses that come before XP parentheticals. Pre-® pause duration is 
the shortest. 

The model presented in Figure 9 and Table 6 below estimates of pause dura- 


tion in the occurrences after ı, ®, verbal parenthetical and XP parenthetical. 


Par.(verb) 
Par.(XP) 


0.00 0.05 0.10 0.15 


Fig. 9: Estimates of the duration of the pauses that occur after ı, ®, verbal and XP parenthetical 
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Estimate SE t value 
Fixed effects 
l 0.140 0.009 15.382 
® 0.010 0.007 1.420 
Par.(verbal) 0.068 0.012 5.636 
Par.(XP) 0.036 0.013 2.867 


Random effects s.d. 
Length Intercept 0.0000 
Speaker Intercept 0.0066 


Table 6: Estimates of the duration of the pauses that occur after ı, ®, verbal and 
XP parenthetical 


Pauses that occur on the right edge of the analysed items support the previously 
attested order, i.e. ı > verbal parenthetical > XP parenthetical > ®. This time the 
differences are clearer, and the estimates of the parameters are more certain. 
The results of this model show that pauses that occur after the parentheticals 
tested are longer than the pauses that occur before them. For the sake of com- 
pletion, Figure 10 and Table 7 present estimates of detailed model parameters 
for pause durations before and after the phrase types. 


t = tr oe 
-ar "or -ar = © 
®-app mes -app 7 
finnon =o finnon ~ —|— 
adfin = adfin 4 e. 
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adper =p adper 4 —— 
appo —o— appo — =. 
admit ~ —e— admit — —-e— 
= a | rs rs a | 
0.00 0.05 0.10 0.15 0.00 0.05 0.10 0.15 


Fig. 10: Estimates of the duration of the pauses that occur before (left) and after (right) all types 
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Before After 
Estimate SE tvalue Estimate SE t value 

Fixed effects 

l 0.139 0.011 12.315 0.140 0.009 15.473 

®-ar 0.011 0.011 1.040 0.009 0.008 1.158 

®-app 0.016 0.013 1.215 0.015 0.014 1.017 

finnon 0.030 0.022 1.397 0.136 0.026 5.142 

adfin 0.050 0.015 3.422 0.046 0.016 2.825 

com 0.020 0.018 1.114 0.063 0.022 2.886 

adper 0.006 0.016 0.378 0.014 0.018 0.801 

appo 0.053 0.020 2.645 0.083 0.022 3.829 

admit 0.005 0.022 0.242 0.015 0.026 0.560 
Random effects s.d. 

Length Intercept 0.0061 0.140 

Speaker Intercept 0.0089 0.009 


Table 7: Estimates of the duration of the pauses that occur before (left) and after (right) all 
types 


The detailed models support the above conclusions, with the previously ob- 
served outliers. Now, we focus on the frequency of pauses rather than their 
duration. Figure 11 presents the number of pauses before and after each phrase 
type. 
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Fig. 11: The rate of pauses before (left) and after (right) constituents for all types 


The results of the distribution of the pauses that occur before and after the items 
analysed showed an ordering similar to the one of the results of the duration of 
the pauses. Among the parenthetical set, inter and voca exhibit the highest 
number of pauses on their right edge, which is at least as high as the number of 
pauses that come after t boundaries. When we compare the verbal parentheti- 
cals to XP parentheticals (excluding inter and voca), we see that verbal paren- 
theticals exhibit higher occurrences of pauses, which were also longer. Pauses 
on both sides of the ® boundaries are shorter and fewer in number. The pauses 
surrounding the XP parentheticals are closer to ® boundaries in duration and 
distribution. However, in contrast to their right edge, appositives exhibit a 
higher amount of pauses on their left edge. The ordering is shown in (13): 


(13) ı > verbal parenthetical > XP parenthetical > ® 


In addition to the ordering we observed in (13), another conclusion of this sub- 
section is that the pauses that come before each parenthetical type are shorter 
and fewer in number, whereas the pauses that come after each type are longer 
and more frequent. 
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4.3 Final rise 


The degree of rise of the final syllables of s and ts in the control data is com- 
pared to the degree of rise of the final syllables of each parenthetical in the test 
data. In the case of t-final rise, the set ‘’ is divided into two groups; (i) on nucle- 
us-ı (i-n), where the t-final item itself is the nucleus and the t ends with a high 
plateau (8a), and (ii) post-nucleus t boundary (i-pn), where the t-final ® is more 
than one word and the right edge comes after the nucleus (8b). 
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Fig. 12: The degree of rise in the final syllables 


The box plot on Figure 12 represents the difference between the mean pitch 
value of the final and penultimate syllables for all types. The first impression we 
get from these graphs is again similar to the previous cues. The set of Os shows 
the least degree of final rise. Within the set of parentheticals, verbal parentheti- 
cals seem to show a higher rise in comparison to XP parentheticals. Again, inter 
exhibits a very different trend than the other verbal parentheticals. Specifically, 
it exhibits the lowest degree of pitch difference, which indicates in most cases 
there is no rise but fall indicating a low boundary, L%. This becomes more visi- 
ble when the individual FO contours of inter are examined. The other exception- 
al case for its category was voca in the set of XP parentheticals. Similar to the 
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results attained from the previous cues, voca exhibits a variation in its group 
and bears a higher final rise. Excluding the type inter, the final rise of ts and 
verbal parentheticals are alike. Similarly, excluding the type voca, final rise of 
Ọs and XP parentheticals are closer to each other. 

We observe that there is some variation in the case of ı based on whether ı 
shares its last word/syllable with the last syllable of the nucleus (i-n), or wheth- 
er it follows the nucleus (i-pn). Accordingly, the former bears a smaller magni- 
tude of final rise in comparison to the latter. This variation is expected consider- 
ing the transmission from the low levelled pitch level of the post nuclear area to 
a H% boundary which is triggered by comma intonation. The same variation is 
also observed in the cases of ®-final rise. Specifically, the phonological phrases 
that bear only one word (®-ar-w) show a smaller magnitude of final rise, where- 
as phonological phrases that contain more than one word (@-ar-w) exhibit a 
higher rise. This variation is due to the difference in the pitch register across the 
head and non-head parts of the Ms that contain more than one word. 

We also fit a model that takes length as well as the speaker variation as ran- 
dom effects (against systematic pitch-range variation due to speakers). First, a 
model that only distinguishes t, ® and two main parenthetical subdivisions is 
presented in Figure 13 and Table 8. 


L i 
© -------- 
Par.(verb) oa se 


Par.(XP) - — A 


0.0 1.0 2.0 3.0 


Fig. 13: Estimates of the final FO rise of ı, ®, verbal parenthetical and XP parenthetical 
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Estimate SE t value 

Fixed effects 

l 2.106 0.557 3.782 

0) 1.700 0.519 3.275 

Par.(verbal) 2.469 0.577 4.277 

Par.(XP) 1.556 0.576 2.700 
Random effects s.d. 

Length Intercept 0.6159 

Speaker Intercept 0.7905 


Table 8: Estimates of the final FO rise of ı, ®, verbal parenthetical and XP parenthetical 


All phrase types indicate a rise from penultimate syllable to final syllable. Ver- 
bal parentheticals show the highest rise, followed by t, XP parentheticals and ®, 
with a rather small difference between the XP parentheticals and ®. This sup- 
ports a ‘verbal par > ı > XP par >= ®’ ordering. The model below shows the case 
of detailed grouping: 


L —e— 
-ar u 
-app —— 
finnon ee 
adfin —— 
com —— 
adper —— 
appo nn 


Fig. 14: Estimates ofthe amount ofthe final FO rise for alltypes 
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Estimate SE t value 

Fixed effects 

l 2.255 0.531 4.246 

®-ar 1.811 0.503 3.601 

®-app 1.250 0.584 2.140 

Finnon 2.850 0.822 3.467 

Adfin 2.721 0.645 4.221 

Com 1.931 0.716 2.698 

Adper 2.199 0.645 3.409 

Appo 1.663 0.712 2:337 

Admit 0.117 0.816 0.144 
Random effects s.d. 

Length Intercept 0.5007 

Speaker Intercept 0.7907 


Table 9: Estimates of the amount of the final FO rise for all types 


The main trend is similar to the results above. Although the majority of verbal 
parentheticals exhibit high rise (which is higher than the ı condition), comment 
clauses diverge and exhibit a rise that is closer to ® condition. The ordering of 
the four major groups in terms of the final rise is shown in (14): 


(14) verbal par > ı > XP par = © 


4.4 Initial lowering 


In the control, the mean FO of the initial syllable of the ®, where the L is ob- 
served, is subtracted from the mean FO of the final syllable of the preceding ®, 
where the H- is observed. The same procedure is also applied to t boundaries. 
Note that, for this condition, only the non-initial Ds and ts are calculated - i.e. in 
® cases, sentence initial syllables are excluded. In ı cases, only the t-initial syl- 
lables of the non-initial sentences are analysed. 

In the test set, the mean FO of the initial syllable of the parentheticals is 
subtracted from the mean FO of the final syllable of the host (non-)constituent 
that immediately precedes the parenthetical. The graph in Figure 15 presents 
the mean difference of initial lowering. 
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Fig. 15: The difference (in semitones) of the mean FO of the initial syllables, and the mean FO of 
the final syllables of the items that immediately precede them 


The graph shows a degree of lowering for all groups. However, there does not 
seem to be a consistent difference across all types. In fact, the figure shows that 
for all types, the initial fall values are centred between 1 and 2 semitones for all 
cases. 

Figure 16 and Table 10 present the results we obtain when we investigate in- 
itial lowering with a model with four groups (also including the ı-n/ı-pn distinc- 
tion): 


t-n 
ı-pn 

® 
Par.(verb) 


_. 
—e— 
i 
— 
Par.(XP) 7 Or 


-1.0 0.0 1.0 2.0 


Fig. 16: Estimates of initial lowering for ı-n, t-pn, verbal parenthetical, XP parenthetical, and © 
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Estimate SE tvalue 

Fixed effects 

tn 0.290 0.328 0.886 

pn -0.811 0.333 -2.438 

(0) 1.561 0.246 6.349 

Par.(verbal) 1.525 0.304 5.024 

Par.(XP) 1.308 0.323 4.052 
Random effects s.d. 

Length Intercept 0.0000 

Speaker Intercept 0.4084 


Table 10: Estimates of initial lowering for ı-n, ı-pn, verbal parenthetical, XP parenthetical, and ® 


While the base level (i-n) shows the least differences in pitch, the post-nuclear ı 
(i-pn) shows the most. The initial syllable of the ı that follows t-pn exhibits high- 
er FO than the final syllable of the preceding t, which bears a H% tone. This may 
be due to two reasons: Either the rise on the right edge of ı-pn cases is not as 
high, or the ts that succeed the ts that end with a post-nuclear area start higher 
(e.g. higher than the right edge of t-pn cases) and not low. ®-initial FO level is 
almost the same as verbal and XP parentheticals. Bs, verbal parentheticals and 
XP parentheticals exhibit substantial lowering on their left edges, which is con- 
siderably lower than both of the ı-initial cases. The last model below shows the 
case when we apply the model to all subgroups. 
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in 7 = 
ipn 4 —e 
®-ar-w ~ _. 
@-ar-ws — —o— 
®-app-w 7 —._— 
®-app-ws ~ —— 
adfin — —o 
com — —e— 
finnon — TE Zu 
admit — —— 
adper — _—— 
appo ~ -e 
CELTI EEJ 


Fig. 17: Estimates of the initial lowering of all sub-types 


Estimate SE t value 
Fixed effects 
n 0.290 0.328 0.883 
-pn -0.811 0.334 -2.432 
Ọ-ar-w 1.598 0.264 6.048 
®-ar-ws 1.412 0.435 3.250 
®-app-w 1.399 0.480 2.913 
®-app-ws 1.664 0.480 3.464 
adfin 1.428 0.379 3.769 
com 1.667 0.455 3.661 
finnon 1.553 0.554 2.805 
admit 1.647 0.554 2.975 
adper 1.192 0.417 2.857 
appo 1.218 0.512 2.378 
Random effects s.d. 
Length Intercept 0.0000 
Speaker Intercept 0.4080 


Table 11: Estimates of the initial lowering of all sub-types 


We observe that the difference between ® types is not substantial, and that all 
subtypes of parentheticals resemble the ® condition. The information we gather 
from initial lowering is not sufficient to attribute these tendencies to left edge 
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marking, since we cannot be sure if the difference is a result of the variation in 
the low start on the left edge of the constituents, or the higher/lower end on the 
right edge of the preceding items of the corresponding constituents. Consider- 
ing the initial lowering results, one cannot conclude that there is a pitch reset in 
the case of parentheticals. Nor can we conclude that left edge FO is employed to 
mark a difference between the left edges of Ms and ts in Turkish. Precisely how 
to draw generalizations from these results is not yet clear to us, and hence must 
remain an issue for further research. 


4.5 An overview of the results 


The tables below summarize the results. Table 12 lists the properties of paren- 
theticals in two main groups: verbal parentheticals and XP parentheticals.!* 
Table 13 presents the results in terms of the category (® or ı) to which the sub- 
types of parentheticals are closer.!5 


Type of parenthetical 
Type of measure Type of edge Verbal XP 
Final syllable duration Right edge <,>® <,>® 
Pre-parenthetical syllable duration Left edge =I <,=® 
Pause length before Left edge =I <si> aG 
Pause length after Right edge zı <1, > Ọ 
The amount of final rise Right edge >I <,=® 


The amount of initial lowering Left edge 


Table 12: Summary of the results with main groups of parentheticals 


14 We thank the anonymous reviewer for the suggestion of this table for a summary of the 
findings. 
15 Any sub-type of parentheticals that do not pattern with the other members of its group is 
italicized. 
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Type of measure Phonological Phrase Intonational Phrase 
3 Final rise admit, appo, adper, com finnon, adfin 
v 
E Final syllable duration admit, appo, adper, com finnon, adfin 
v 
= Pause length after admit, adper finnon, adfin, com, 
GA appo 

Pre-par. final syllable duration admit, adper, com finnon, adfin, appo 
v 
on 
g 3 Pause length before admit, adper, com finnon, adfin, appo 
zo 
4 Initial lowering 


Table 13: Summary of the results with subgroups of parentheticals 


5 Discussion 


The orderings of the parenthetical types with respect to t and ® boundaries in 
section 4 encode two important observations. First, verbal parentheticals exhib- 
it ‘stronger’ boundaries that are similar to t edges. Second, XP parentheticals 
exhibit ‘weaker’ boundaries that are similar, if not identical to, ®© edges. This 
dichotomy both supports and contradicts the theories of syntax-prosody map- 
ping. That finite clausal parentheticals are prosodically parsed closer to intona- 
tion phrases supports the idea of clause-to-t mapping. It also supports the as- 
sumption that parenthetical structures are prosodically isolated. However, XP 
parentheticals exhibit the properties closer to phonological phrase-hood (i.e. 
they exhibit prosodic integration). This indicates either (i) that there is not such 
a tight mapping between syntax and prosody in Turkish, or (ii) that the map- 
ping constraints are ranked and can, in certain environments, override one 
another. 

Mapping appears to be overridden in two ways in Turkish. First, Turkish 
parentheticals are shown to exhibit instances of both prosodic isolation and 
integration. This indicates that the prosodic realization of syntactic isolation is 
not highly ranked. More important is to prosodically mark the clause-hood and 
phrase-hood of the target syntactic structures. Thus, regardless of whether they 
are extra-sentential or not, root clauses are parsed as ts, and structures that do 
not exhibit the properties of root clause-hood tend to be parsed as Os. 

Ranked higher than constituent-to-constituent mapping (and consequently 
syntactic isolation) is pragmatic relation. If a parenthetical is pragmatically 
isolated then it is parsed as an ı, regardless of its syntactic type or level of syn- 
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tactic isolation. This was observed with vocatives!® and interjections, which 
exhibit longer final syllables, and longer pauses on their right edge. The final 
syllable of the host part that immediately precedes them is also longer than XP 
parentheticals in the test set, and Ms in the control set. Final rise values also 
provide evidence of their t-hood. While vocatives exhibit the highest magnitude 
of final rise (H%), interjections exhibited the lowest values of final pitch rise, 
which is even lower than ®s’ condition. We claim that this is due to a low tone 
that marks the right edge of the intonation phrase (L%). It is not surprising to 
observe falling intonation instead of a rising comma intonation in such cases. 
Since interjections are not related to the content of the discourse, and since they 
are, in fact, interruptions not only to the host syntactic structures but also to the 
host discourse structure, they do not necessitate the use of comma intonation, 
which typically marks a continuation of the ongoing discourse. Therefore, we 
conclude that pragmatically isolated parentheticals such as clausal interjections 
and vocatives, the latter of which are XPs, are parsed as ts regardless of their 
syntactic properties. 

In a similar fashion semantic/pragmatic properties of comment clauses 
override their syntactic properties in mapping to prosody, yielding them to be 
parsed as ®s and not ıs. In this case, we observe that when a parenthetical pre- 
sents a speaker’s stand towards the truth of the entire host proposition, then it 
is prosodically integrated regardless of its syntactic makeup. Giines (2013) ob- 
serves that yanılmıyorsam ‘if I am not mistaken’, a clausal parenthetical, is pro- 
sodically integrated. This is not surprising considering its function as a com- 
ment clause. In fact, based on the syntactic data and their pragmatic behaviour, 
Griffiths and Giines (2014) conclude that regardless of their clausal syntax and 
undominated (parenthetical) syntactic behaviour, comment clauses in Turkish 
do not exhibit root clausal properties. In this regard, they suggest, these struc- 
tures should be analysed on par with non-clausal parentheticals. Our findings 
support the syntactic-pragmatic analysis of these authors. This phenomenon 
that we observe in Turkish comment clauses is also observed in other languages 
(cf. Reinhart 1983, Reis 2000, Dehé and Wichmann 2010, Dehé 2014). 

Another parenthetical type that does not seem to follow the generalization 
made in this study is nominal appositives. We observe that appositives exhibit 
stronger isolation cues especially on their left edge. This is not surprising con- 


16 See Göksel and Pöchtrager (2013) for various prosodic realizations of a wider range of 
vocatives in Turkish. One should note that the types of vocatives investigated by these authors 
are not the same as the ones that are analysed in this paper. These authors investigate non- 
interpolating vocatives that convey meanings such as surprise, calling, checking for identity, 
and so on (ibid.:92). 
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sidering the linear position of appositives and the way they modify their an- 
chors. The appositives that are tested in this study are constituent-modifying 
parentheticals, which must immediately linearly follow their anchors (Griffiths 
and Güneş 2014).'’ Functionally, appositives provide an alternative referent for 
their anchor, while syntactically, they and their anchors share the same maxi- 
mal projection (ibid.). Their linear position and syntactic-semantic similarity 
with the anchor forces a stronger prosodic boundary in the juncture of the ap- 
positive and its anchor, which acts as a parser that separates these syntactic- 
semantic likes. One can envisage this as a prosodic strategy that ensures that 
Richard’s (2001) Distinctness Condition on Linearization, or some constraint 
similar to it, is satisfied. However, the presence of a stronger left edge boundary 
does not engender total prosodic isolation of appositives, as they do not exhibit 
t-level properties on their right edge. That they are parsed as Os is further sup- 
ported by Griffiths and Günes (2014). These authors note that, while prosodical- 
ly isolated parentheticals such as ki clauses (9b) cannot occupy the nucleus of 
their hosts, appositives can. Similarly, while parentheticals that are parsed as ts 
cannot occupy post-nuclear area of their host-ı, appositives can. In this respect, 
appositives exhibit the same prosodic properties as their anchors, and as other 
arguments of their host. Therefore, we consider appositives to be parsed as Ọs, 
but with a more pronounced left edge. 

Our results show that pauses are employed to mark the edges of both ® and 
t. However, the duration of the pauses displays variation. While verbal paren- 
theticals exhibit similar values to the edges of ts, XP parentheticals are found to 
be closer to the pauses surrounding ®s. Note that, in all cases, a pause that 
precedes a parenthetical is shorter than the pause that follows it. 

This fact may be analysed in two ways. First, although their prosodic cate- 
gory type matches with their syntax (XP to ® and clause to 1), parentheticals 
inherit some properties of their paratactic nature. The resulting prosodic form 
may be represented as a hybrid prosodic category type, which has slightly 
stronger boundaries than their already existing correlates, call it “By.” for XP 
parentheticals, and “tpar” for verbal parentheticals, where the hierarchical order 
of types is D < par < tpar < 1. Alternatively, instead of postulating a new category 
type, one can analyse this observation as a case of prosodic recursion, where 
the prosodic unit created by recursion is marked by a greater degree of promi- 
nence than a unit not created by recursion on its right edge. To illustrate the 


17 Their correlate structure in English is constituent modifying appositives; i.e. namely- 
parentheticals. In Turkish, instead of namely an Arabic loan yani is optionally used. For a 
detailed syntactic analysis of yani parentheticals see Griffiths and Giines (2014). 
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latter analysis, we postulate the structure in (15) for verbal parentheticals, inter- 
jections and vocatives.!® 


(15) 
Utterance 
t-non-min l 
min min host 
host verbal-parentheticals 
vocatives 


interjections 


The structure in (15) is a recursive prosodic structure in which the non-terminal 
prosodic type t-non-min displays a ‘more prominent’ right edge. This promi- 
nence is realized on the right edge of the parenthetical, which is the right- 
branching daughter of t-non-min. The structural position of the parenthetical in 
(15) thus explains why pause durations before the parenthetical are shorter than 
those that occur after the parenthetical: the former marks the edge of an ‘atom- 
ic’ prosodic unit, while the latter marks the edge of a prosodic unit built from 
self-similar units (in this case, the self-similar units are ts). 

Longer pauses are not reserved for marking verbal parentheticals. XP par- 
entheticals also exhibit pauses on their right edge, yet these pauses are not as 
long as the pauses observed in the t boundary condition, even though they were 
longer than the pauses that linearly precede XP parentheticals. In fact, the 
pauses observed after XP parentheticals are longer, but still closer to the dura- 
tion of the pauses that follow ® boundaries. Keeping in mind that the XP paren- 
theticals are also similar to the ® condition in terms of final syllable lengthen- 
ing, final rise, and pre-parenthetical host-final syllable durations, we claim that 
sentence-medial XP parentheticals are not the immediate daughters of any ı 
(neither a t-non-min, nor a t-min). For XP-parentheticals, we postulate the struc- 
ture in (16). 


18 t-non-min refers to any ı that dominates another ı. ı-min refers to any t that does not domi- 
nate another ı. See Elfner (2012) for cross-linguistic evidence that necessitates the use of mini- 
mal and non-minimal projections. 
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(16) l 
A 
®-non-min D 
ern. Ei 
®-min -min host 
host XP-par. 


Like (15), (16) is a recursive structure. Again, recursion explains the disparate 
durations of the pauses on each side. The only difference between (15) and (16) 
is that, in (16), the non-terminal unit created by recursion is a ®, whereas in (15) 
it is an t. Thus, we expect - and do - observe that the right edge of ®-non-min 
displays the properties of a ®-boundary, albeit one that is more prominent than 
the boundary observed on an ‘atomic’ ® on its right edge. 

The argument propounded above accords with the idea of recursive prosod- 
ic levels, where the more recursive layers there are, the stronger the boundaries 
are marked (Kawahara 2012; Itö and Mester 2009, 2012). What is novel about our 
argument is the claim that recursion is encoded in only pause duration, and 
that other cues, together with the duration of pauses, are employed only to 
mark a prosodic unit as a w or Dor. 

Note that a non-recursive model cannot explain the data adequately since it 
would predict equal boundary strength on both edges of any category. 


6 Conclusion 


In conclusion, verbal parentheticals exhibit t-level properties on both edges. XP 
parentheticals, on the other hand, exhibit phonological phrase-level properties 
on both edges. Clausal parentheticals are prosodically isolated, supporting the 
theories of syntax-prosody mapping, while XP parentheticals are prosodically 
integrated, partially supporting syntax-prosody mapping theories. The latter 
result supports theories that assume XP-to-phonological phrase matching, but 
not those that predict the prosodic isolation with parentheticals. We conclude 
that Turkish marks constituent-to-constituent matching of syntax and prosody 
more faithfully than the mapping of syntactic isolation. Additionally, mapping 
of pragmatic isolation is highly favoured. Specifically, pragmatically isolated 
parentheticals such as vocatives or interjections are prosodically isolated, re- 
gardless of their syntactic make-up. In this sense our findings are on par with 
the findings of Giines (2013). We discussed the prosodic structure of Turkish 
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parentheticals and proposed a representation that favours the recursion of vari- 
ous prosodic category types. Both types of parentheticals are found to exhibit 
relatively longer pauses on their right edges. To account for the unequal distri- 
bution and duration of the pauses, we have proposed a recursive prosodic rep- 
resentation. Overall, our experiment has revealed various aspects of the prosody 
of parenthetical insertions in Turkish. However, our study is exploratory and 
leaves several questions for future inquiry. Additionally, the results reported 
here were based on the controlled productions of four speakers; for a more 
elaborate account of the parentheticals in Turkish, experiments with more 
speakers should be carried out, as well as the analyses of spontaneous speech. 
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